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Abstract This paper proposes a family of estimators of population mean using information on 
several auxiliary variables and analyzes its properties in the presence of measurement errors. 


Introduction 
The discrepancies between the values exactly obtained on the variables under 
consideration for sampled units and the corresponding true values are termed 
as measurement errors. In general, standard theory of survey sampling 
assumes that data collected through surveys are often assumed to be free of 
measurement or response errors. In reality such a supposition does not hold 
true and the data may be contaminated with measurement errors due to 
various reasons (see, for example Cochran (1963) and Sukhatme et al. (1984)). 
One of the major sources of measurement errors in survey is the nature of 
variables. This may happen in case of qualitative variables. Simple examples of 
such variables are intelligence, preference, specific abilities, utility, 
aggressiveness, tastes, etc. In many sample surveys it is recognized that 
errors of measurement can also arise from the person being interviewed, from 
the interviewer, from the supervisor or leader of a team of interviewers, and 
from the processor who transmits the information from the recorded interview 
on to the punched cards or tapes that will be analyzed, for instance, see Cochran 
(1968). Another source of measurement error is when the variable is 
conceptually well defined but observations can be obtained on some closely 
related substitutes termed as proxies or surrogates. Such a situation is 
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encountered when one needs to measure the economic status or the level of 
education of individuals, see Salabh (1997) and Sud and Srivastava (2000). In 
the presence of measurement errors, inferences may be misleading, see Biermer 
et al. (1991), Fuller (1995) and Manisha and Singh (2001). 

There is today a great deal of research on measurement errors in surveys. 
An attempt has been made to study the impact of measurement errors on a 
family of estimators of population mean using multiauxiliary information. 


The suggested family of estimators 

Let Y be the study variate and its population mean ¡y to be estimated using 
information on p(> 1) auxiliary variates X;,X»5,...,X;. Further, let the 
population mean row vector wW = (p, M2", up) of the vector 
Xx = (X 1, Xs X n Assume that a maple random sample of size n is drawn 
from a population, on the study character Y and auxiliary characters 
X1, X»,..., Xp. For the sake of simplicity we assume that the population is 
infinite. The recorded fallible measurements are given by: 


y = Y; t Ej 
Xij = X; + Nj, = 1,24 b, 


_ 


where Y; and X; are correct values of the characteristics Y and 
X0 21,2,..4 5: = 1,.2....,m0). 

For the sake of simplicity i in exposition, we assume that the error Es are 
stochastic with mean “zero” and variance ory and uncorrelated with Ys. The 
errors mj in Xy are distributed independently of each other and of the X; with 
mean “zero” and variance oy 2¢ =1,2,...,p). Also Es and Ts are 
uncorrelated although Y;s and Xjs are correlated. 


Define 
i 
ui = e (¿=1,2,**",p), 
uT = (uus, Up) xp: 
pls T 
_ 1X 
y) ig Yj, 
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With this background we suggest a family of estimators of uy as: 


fig = g(y. u^). (2.1) 


839 
where g (y, 17) is a function of y, u, U2,'**, up such that: 
S(uoe T) ks. Ho 
M del: ) 1 
89 lue) 


and such that it satisfies the following conditions: 
* the function g(y, u^) is continuous and bounded in Q; 
* the first and second order partial derivatives of the function g (y, u ) exist 
and are continuous and bounded in Q. 


To obtain the mean squared error of fig, we expand the function g (5, u T) about 
the point (poe?) in a second order Taylor's series. We get: 


ð . 
fig =2 (uo, e^) + (9 — uo) p l l ju -gP (me) | Q3) 
MoeT 
lf, 29g) E rg) 
"al he a» a mee 9 gear) 


Hu — e) g® (ix, ux" )(u — o), 
where: 


y* = po + OF — po), u* = e + Au — e),0 < 0 < lg) 


denote the p element column vector of first partial derivatives of g(-) and g(-) 
denotes a p x p matrix of second partial derivatives of g(-) with respect to z. 
Noting that (uo, e?) = uo, it can be shown that: 


E(ñig) = po + On), (2.3) 


which follows that the bias of ñ, is of the order of n~t, and hence its 
contribution to the mean squared error of fig will be of the order of n ?. From 
(2.2), we have to terms of order z !: 
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= E[G = »o)26 — wo) — o2 mp, e^) 
(a, e!) w= ou — o (aqu, e7))| 
1 
= [wa (C + Cio) + 200710, +(80)"A(810,07) | 
(2.4) 


where bT = (bi, b», .. ., bp), bi, = poiCoC;, (1=1,2,...,p) C; = 05/ m, Cà) = 
o;/ Mj, (1 = 1,2,..., p) and Co = oo/ uo: 


840 


CC POC OCs co py C, 
PCC? Ch Cay po3C2C3 `  papCaC; 
A= | pi3CiC3  paCaCz C Gs == payC3Cp 
pipCiCp PCC PO === CL Ci 

pXp 


The MSE (fig) at equation (2.4) is minimized for 
g” (uoe) = -p4 b. (2.5) 
Thus the resulting minimum MSE of fig is given by: 
min. MSE(fig) = (u4/n) [C + Cty - Ab. (2.6) 


Now we have established the following theorem. 
Theorem 2.1— up to terms of order n7t, 


MSE (fig) = (13/n) [Co + Cty — 5 A^ (2.7) 
with equality holding if: 


g® = -pA Ib. 


It is to be mentioned that the family of estimators fig at equation (2.1) is very 
large. The following estimators may be identified as particular members of the 
suggested family of estimators fig: 


b " b 
aD =) Qj Bp w; = 1, (Olkin, 1958). 


i=1 E i=l 
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if? = D> m (=) , y wi — 1, (Singh, 1967). A family of 
j i=l 


i=l Ps estimators 
b 
Sp b 

1) 2 31... Y” o= 1, (Shukla, 1966; John, 1969). 

peg 2. ( J ) 841 
> o i — 
i=l 


> @;X; b 


je ; > c = 1, (Sahai et al., 1980). 


P wj b 
Lo = y É; .= 1 M h P ik 1 4 
Ë 511 (&) 2. , (Mohanty and Pattanaik, 1984). 


D mA ^ P 
i = y p 2z) >> wi = 1, (Mohanty and Pattanaik, 1984). 
i=l Mi i-l 


b yNO D 
- ag? — »[[ (=) , X_ œ; = 1, (Tuteja and Bahl, 1991). 


i i=1 


-1 
(8) — > P Cj L; P m : 
9-5: , > @ = 1, (Tuteja and Bahl, 1991). 


b " p+1 
if? = y o, |: = 
9 Y | @p+1 P >. 0; (Ë) > Wj 
i=1 i=1 
b x p+l 
BL =3 op +) oi (s) Pm 
al NM] = 
id q " b $ q b = 
a Ub — y [TU D " ] . I 
mes 5 GE x). 
i=1 i=q+1 i=l i=q+1 


(Srivastava, 1965; Rao and Mudholkar, 1967). 


b PN Y 

if? = 5] (=) (ajs are suitably constants) (Srivastava, 1967). 
LI; 
i=1 1 
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af = a] {2 — E y (Sahai and Ray, 1980). 


(Walsh, 1970). 


if = 
y 
Al "s + 0% a uj) 
p = ju. 6log ui} (Srivastava, 1971). 
i=1 
b 
ñ 805 = Pexpí> ` 0;(u; — 1)) (Srivastava, 1971). 
i=] 
b b 
ag = y» cyexp1 (6;/ 0) log uj > w; = 1, Grivastava, 1971). 
i=1 i=1 


b 
. ag =) + > G; (Xi — I); etc. 


The MSE of these estimators can be obtained from equation (2.4). 
It is well known that: 


VO) = (13/m) (C$ + Ci). (2.8) 


It follows from equation (2.6) and equation (2.8) that the minimum variance of 
fig is no longer than conventional unbiased estimator y. 

On substituting To) = =0, oq; = 0Vi = 1,2,...,pin equation (2. 4), we obtain 
the no- ESE B case. In that case, the MSE of fig, is given by: 


MEIS, ; [Cis + 2pob Tg * (uo, 6T) + (g * Quo 67) A (gs 9). 
= MSE(fig*), 
(29) 


where: 
(2.10) 


and YandX; (i — see b) are the sample means of the characteristics Y and 
X; based on true measurements. (Y;, X; 1 = 1,2,...,5;7 = 1,2,...,n). The 
family of estimators fig* at equation (2.10) is a generalized version of 
Srivastava (1971, p. 80). 


Downloaded by Professor Florentin Smarandache At 08:13 27 April 2018 (PT) 


The MSE of fig* is minimized for: 


gx" (uy el) = —A x 1 bpo. (2.11) 
Thus the resulting minimum MSE of fi,* is given by: 
: " p E 
min.MSE(fig*) = = Ë —bTA x7! b (2.12) 
=—(1-R 
^ — p) 


where Ax = [a*;] is a p X p matrix with ax; = p;C;C; and R stands for the 
multiple correlation coefficient of Y on X1, X», .. ., Xp. 

From equations (2.6) and (2.12) the increase in minimum MSE (fig) due to 
measurement errors is obtained as: 


2 
min.MSE (fig) — min.MSE (fx) = (5) [Ci +b7A «1b - 57A 


> 0. 


This 1s due to the fact that the measurement errors introduce the variances 
fallible measurements of study variate Y and auxiliary variates X;. Hence there 
is a need to take the contribution of measurement errors into account. 


Biases and mean square errors of some particular estimators in the 
presence of measurement errors 

To obtain the bias of the estimator fiz, we further assume that the third partial 
derivatives of 2 (5, u 2 also exist and are continuous and bounded. Then 
expanding g(y,47) about the point (y,u^) = (my,e?) in a third-order 
Taylor's series we obtain: 


0 s 
fig = g(uo,e7) + (9 — mo) T l p — à gu e^) 
pose T 
D og) : 
Tore ws - 25 = uo) olg se — og 


+u — e) (guo, eT))(u — e) 
1 à ð 
tal 10) atu DES ue), 


where g0? (uo, e?) denotes the matrix of second partial derivatives of g (y, T ) 
at the point (5,u*) = (uo, e). 
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Noting that: 
(we) = mo 
9g(*) E 
oy (pose T) 
9^g(-) -0 
oy? (10.27) j 


and taking expectation we obtain the bias of the family of estimators ñ to the 
first degree of approximation: 


1: 
B(he) = 5 [Et (¿y yu) +2(5)07 (9,07) 
(3.2) 


where bT = (bi, b», ..., by) with bi = poiCoC;; (i = . D). Thus we see that 
the bias of ñ; depends also on the second e d derivatives of the 
function on gu „u T) at the point (uo, c7), and hence will be different for 
different optimum estimators of the family. 

The biases and mean square errors of the estimators ji; 7 = 1 to 18 up to 
terms of order n~! along with the values of g® (uo, s gÜ (uo, e^) and 
g P (uo, eT) are given in the Table I. 


Estimators based on estimated optimum 

It may be noted that the minimum MSE, equation (2.6), is obtained only when 
the optimum values of constants involved in the estimator, which are functions 
of the unknown population parameters uo, b and A, are known quite accurately. 

'To use such estimators in practice, one has to use some guessed values of the 
parameters uo, b and A, either through past experience or through a pilot 
sample survey. Das and Tripathi (1978, sec. 3) have illustrated that even if the 
values of the parameters used in the estimator are not exactly equal to their 
optimum values as given by equation (2.5) but are close enough, the resulting 
estimator will be better than the conventional unbiased estimator y. For further 
discussion on this issue, the reader is referred to Murthy (1967), Reddy (1973), 
Srivenkataramana and Tracy (1984) and Sahai and Sahai (1985). 

On the other hand if the experimenter is unable to guess the values of 
population parameters due to lack of experience, it is advisable to replace the 
unknown population parameters by their consistent estimators. Let ó be a 
consistent estimator of $ó = A~!D. We then replace $ by $ and also po by y if 
necessary, in the optimum fig resulting in the estimator Ages), say, which will 
now be a function of y, u and $. Thus we define a family of estimators (based on 
estimated optimum values) of uy as: 


= 19 Segoe Ss 
O = & Ë n S= 
>, = 0D SABRES 
TON ScEE 
iz a =o 
S YN 
«q uc 
S CD NO y 
+= uU 9 E 5 
< ° S SE 
mas 
> 
(panuyuod) 
dxd dxd 
É 6 4 9 qa + 0 6 q+ ÑIY, dE 6 ( ï- °) oriz 50 qd 
dxd dxd 
ove 8 qz + s = r AMO ZE 6 _ [ 4.00) i ii: d 
v +6 ,q¿— 5 +f " TRA JL a-— 6 az Ar ot 
in dxd 
k + 6 , qa — “o þau- a mc x ð- ( a+ 0) Onl a o 
EC a E dxd E 
ena el } SUM I 4 me (E Sym (xgeugnu ó ng qr 
Sy, wa | 5,4 5 D 
GST = DGTlto = x51) qus 
E (x deo cos tO x 10) = x@ oou 
oad 1,0 o 0 (&) ms opin @ (2) ur 9 i o me _ 
Ty 70 «9, qe Jo iq Wy xr E NEA ES ot 
dxd 
É 48 +6 qa O 4 5 (z) @ q(ss) 5 mew) ð et od 
o + $97.95 + 29105 + ij = ¿9 MOYA (do :: ` ` ‘ge 10)81p = d x dm aym 
dxd dxd 
k F9 19% E i >] (s) (2.0 10) (ca) o— Mong oon qu 
ASIN seg (120) gyF (120) g,5 (L290) yF — 1oyeumsg 
al z I 


(Ld) 8107 dW LZ €1:80 1V eupepuereurs ugua 1ossəJoiq Áq pəpeoluwoq 


(panuyuod) 


dxd dxd 
BY +o az- 9 + O| (ss) 5,q-Py,B- 5,9 (v) D= x = gee m Ts C 
(40 zo To) Serp = < (Io ^ To To) = > SIAM 
dxd dxd 
SV 55 q+ QO + DO s) S45 55,9799 (G) ge eue LIE 
dxd 7 
0 0 0 0 0 0 
0 0 0 0 0 0 
0 0 by. 0 0 0 dxdCD. 
(0 ZW) = 4 
tg) DIFE) = LAO 
0 0 0 o 0 0 
0 0 0 0 o 
0 0 0 0 0 lol 
(D 
dXxt(dop + THbo Vg Gm—Ia-) = 0 INYM 
(D (D w PA i EJ 
Gy e q QO + (s) ep % Az Ont (Dep qp d 
ove +6 90+ S) T 19 (x) Dala) 6 Ó oo ot 
dxd dxd 
ove F9, qz 9 + %| (25) 6,4- A; (G) g= Ai "tz orn wt 
CON Seq (120 gy (120) 5 (120) (9 — 10Jeunsq 
= 
or © 2 
NS x S 
=o o0 E 


(Ld) 8107 dW LZ £1:80 1v eupepuereurs unuaj JOSS3JOIH Áq pəpeoluwoq 


A family of 
estimators 


847 


Table I. 


(d fep = 100) = !xD qus do 87 


dxd dxd 


o Lo) = xD ƏI9UA 


*Dy 540,024 “O + 29] (7) poserqur] 0 0 xð del 
(r-i) o (15) 0 ep = 9 agm 
OV 0 - 8, qc + 72 + 20| (G) 8,05 *** @ ,2 | (60) 0 Ont Pd Qj gon 7g 
Ov 8 - 8,2 O+ bo (a) 8 ,qz + 8v ,8 | (3) à oo" gn Sy 
dxd dxd 
vð -8, qc + 95 +2 (5) a+  O,2-0V,0 (55) [9 *«9'19)Sep — aym 
dxd 
D | 6 - 88 |f rl ter Y 
0 
BY 545,97 -%9+00| (65) B- Diz ku qi d 
ASIN seg (L120 5 (12:01), 8 (120) pF  10jeunsq 
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Matest) =g xx (5, ut, 7), (4.1) 
where g * *(-) is a function of (y, T, $7) such that: 
0 š 
g* * (uo, 2,97) = po for all ys es RERUM =1 
oy (Boet, pT) (4.3) 
dg * *(-) ag) i I 
Er ou = = oA Ib = — Hod, 
£ imete?) 4 (uy) 
and: 
0 . 
erat) = 0. 
ab (use 7,67) 


With these conditions and following Srivastava and Jhajj (1983), it can be 
shown to the first degree of approximation that: 


MSE (fis) = min. MSE (fig) = (P?) [Ci C — 57A ^0]. 


Thus if the optimum values of constants involved in the estimator are replaced 
by their consistent estimators and conditions (4.2) hold true, the resulting 
estimator fu," will have the same asymptotic mean square error, as that of 
optimum fiz. Our work needs to be extended and future research will explore 
the computational aspects of the proposed algorithm. 
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